64 research outputs found

    Assessing Steady-State, Multivariate Experimental Data Using Gaussian Processes: The GPExp Open-Source Library

    Get PDF
    Experimental data are subject to different sources of disturbance and errors, whose importance should be assessed. The level of noise, the presence of outliers or a measure of the “explainability” of the key variables with respect to the externally-imposed operating condition are important indicators, but are not straightforward to obtain, especially if the data are sparse and multivariate. This paper proposes a methodology and a suite of tools implementing Gaussian processes for quality assessment of steady-state experimental data. The aim of the proposed tool is to: (1) provide a smooth (de-noised) multivariate operating map of the measured variable with respect to the inputs; (2) determine which inputs are relevant to predict a selected output; (3) provide a sensitivity analysis of the measured variables with respect to the inputs; (4) provide a measure of the accuracy (confidence intervals) for the prediction of the data; (5) detect the observations that are likely to be outliers. We show that Gaussian processes regression provides insightful numerical indicators for these purposes and that the obtained performance is higher or comparable to alternative modeling techniques. Finally, the datasets and tools developed in this work are provided within the GPExp open-source package

    Mapping human temporal and parietal neuronal population activity and functional coupling during mathematical cognition

    Get PDF
    Brain areas within the lateral parietal cortex (LPC) and ventral temporal cortex (VTC) have been shown to code for abstract quantity representations and for symbolic numerical representations, respectively. To explore the fast dynamics of activity within each region and the interaction between them, we used electrocorticography recordings from 16 neurosurgical subjects implanted with grids of electrodes over these two regions and tracked the activity within and between the regions as subjects performed three different numerical tasks. Although our results reconfirm the presence of math-selective hubs within the VTC and LPC, we report here a remarkable heterogeneity of neural responses within each region at both millimeter and millisecond scales. Moreover, we show that the heterogeneity of response profiles within each hub mirrors the distinct patterns of functional coupling between them. Our results support the existence of multiple bidirectional functional loops operating between discrete populations of neurons within the VTC and LPC during the visual processing of numerals and the performance of arithmetic functions. These findings reveal information about the dynamics of numerical processing in the brain and also provide insight into the fine-grained functional architecture and connectivity within the human brain

    Predicting numerical processing in naturalistic settings from controlled experimental conditions

    Get PDF
    Machine learning research is interested in building models based on a training set that can then be applied to new data, whether this unseen data comes from new examples (e.g. New subjects, other tasks) or new features (e.g. Different modalities). In this work, we present a simple approach to transfer learning using intracranial EEG (also known as electrocorticographic, ECoG) data from three patients. More specifically, we aimed at detecting numerical processing during naturalistic settings based on a model trained with controlled experimental conditions. Our results showed significant prediction accuracy of numerical events in naturalistic settings when considering a priori knowledge of the target task

    ABCD Neurocognitive Prediction Challenge 2019: Predicting individual fluid intelligence scores from structural MRI using probabilistic segmentation and kernel ridge regression

    Get PDF
    We applied several regression and deep learning methods to predict fluid intelligence scores from T1-weighted MRI scans as part of the ABCD Neurocognitive Prediction Challenge (ABCD-NP-Challenge) 2019. We used voxel intensities and probabilistic tissue-type labels derived from these as features to train the models. The best predictive performance (lowest mean-squared error) came from Kernel Ridge Regression (KRR; λ=10\lambda=10), which produced a mean-squared error of 69.7204 on the validation set and 92.1298 on the test set. This placed our group in the fifth position on the validation leader board and first place on the final (test) leader board.Comment: Winning entry in the ABCD Neurocognitive Prediction Challenge at MICCAI 2019. 7 pages plus references, 3 figures, 1 tabl

    ABCD Neurocognitive Prediction Challenge 2019: Predicting individual residual fluid intelligence scores from cortical grey matter morphology

    Get PDF
    We predicted residual fluid intelligence scores from T1-weighted MRI data available as part of the ABCD NP Challenge 2019, using morphological similarity of grey-matter regions across the cortex. Individual structural covariance networks (SCN) were abstracted into graph-theory metrics averaged over nodes across the brain and in data-driven communities/modules. Metrics included degree, path length, clustering coefficient, centrality, rich club coefficient, and small-worldness. These features derived from the training set were used to build various regression models for predicting residual fluid intelligence scores, with performance evaluated both using cross-validation within the training set and using the held-out validation set. Our predictions on the test set were generated with a support vector regression model trained on the training set. We found minimal improvement over predicting a zero residual fluid intelligence score across the sample population, implying that structural covariance networks calculated from T1-weighted MR imaging data provide little information about residual fluid intelligence.Comment: 8 pages plus references, 3 figures, 2 tables. Submission to the ABCD Neurocognitive Prediction Challenge at MICCAI 201

    Fast temporal dynamics and causal relevance of face processing in the human temporal cortex

    Get PDF
    We measured the fast temporal dynamics of face processing simultaneously across the human temporal cortex (TC) using intracranial recordings in eight participants. We found sites with selective responses to faces clustered in the ventral TC, which responded increasingly strongly to marine animal, bird, mammal, and human faces. Both face-selective and face-active but non-selective sites showed a posterior to anterior gradient in response time and selectivity. A sparse model focusing on information from the human face-selective sites performed as well as, or better than, anatomically distributed models when discriminating faces from non-faces stimuli. Additionally, we identified the posterior fusiform site (pFUS) as causally the most relevant node for inducing distortion of conscious face processing by direct electrical stimulation. These findings support anatomically discrete but temporally distributed response profiles in the human brain and provide a new common ground for unifying the seemingly contradictory modular and distributed modes of face processing

    Improving SNR and reducing training time of classifiers in large datasets via kernel averaging

    Get PDF
    Kernel methods are of growing importance in neuroscience research. As an elegant extension of linear methods, they are able to model complex non-linear relationships. However, since the kernel matrix grows with data size, the training of classifiers is computationally demanding in large datasets. Here, a technique developed for linear classifiers is extended to kernel methods: In linearly separable data, replacing sets of instances by their averages improves signal-to-noise ratio (SNR) and reduces data size. In kernel methods, data is linearly non-separable in input space, but linearly separable in the high-dimensional feature space that kernel methods implicitly operate in. It is shown that a classifier can be efficiently trained on instances averaged in feature space by averaging entries in the kernel matrix. Using artificial and publicly available data, it is shown that kernel averaging improves classification performance substantially and reduces training time, even in non-linearly separable data

    Adapting to Latent Subgroup Shifts via Concepts and Proxies

    Get PDF
    We address the problem of unsupervised domain adaptation when the source domain differs from the target domain because of a shift in the distribution of a latent subgroup. When this subgroup confounds all observed data, neither covariate shift nor label shift assumptions apply. We show that the optimal target predictor can be non-parametrically identified with the help of concept and proxy variables available only in the source domain, and unlabeled data from the target. The identification results are constructive, immediately suggesting an algorithm for estimating the optimal predictor in the target. For continuous observations, when this algorithm becomes impractical, we propose a latent variable model specific to the data generation process at hand. We show how the approach degrades as the size of the shift changes, and verify that it outperforms both covariate and label shift adjustment

    Multitask prediction of organ dysfunction in the intensive care unit using sequential subnetwork routing.

    Get PDF
    OBJECTIVE: Multitask learning (MTL) using electronic health records allows concurrent prediction of multiple endpoints. MTL has shown promise in improving model performance and training efficiency; however, it often suffers from negative transfer - impaired learning if tasks are not appropriately selected. We introduce a sequential subnetwork routing (SeqSNR) architecture that uses soft parameter sharing to find related tasks and encourage cross-learning between them. MATERIALS AND METHODS: Using the MIMIC-III (Medical Information Mart for Intensive Care-III) dataset, we train deep neural network models to predict the onset of 6 endpoints including specific organ dysfunctions and general clinical outcomes: acute kidney injury, continuous renal replacement therapy, mechanical ventilation, vasoactive medications, mortality, and length of stay. We compare single-task (ST) models with naive multitask and SeqSNR in terms of discriminative performance and label efficiency. RESULTS: SeqSNR showed a modest yet statistically significant performance boost across 4 of 6 tasks compared with ST and naive multitasking. When the size of the training dataset was reduced for a given task (label efficiency), SeqSNR outperformed ST for all cases showing an average area under the precision-recall curve boost of 2.1%, 2.9%, and 2.1% for tasks using 1%, 5%, and 10% of labels, respectively. CONCLUSIONS: The SeqSNR architecture shows superior label efficiency compared with ST and naive multitasking, suggesting utility in scenarios in which endpoint labels are difficult to ascertain

    Predicting anxiety from wholebrain activity patterns to emotional faces in young adults: a machine learning approach

    Get PDF
    BACKGROUND: It is becoming increasingly clear that pathophysiological processes underlying psychiatric disorders categories are heterogeneous on many levels, including symptoms, disease course, comorbidity and biological underpinnings. This heterogeneity poses challenges for identifying biological markers associated with dimensions of symptoms and behaviour that could provide targets to guide treatment choice and novel treatment. In response, the research domain criteria (RDoC) (Insel et al., 2010) was developed to advocate a dimensional approach which omits any disease definitions, disorder thresholds, or cut-points for various levels of psychopathology to understanding the pathophysiological processes underlying psychiatry disorders. In the present study we aimed to apply pattern regression analysis to identify brain signatures during dynamic emotional face processing that are predictive of anxiety and depression symptoms in a continuum that ranges from normal to pathological levels, cutting across categorically-defined diagnoses. METHODS: The sample was composed of one-hundred and fifty-four young adults (mean age=21.6 and s.d.=2.0, 103 females) consisting of eighty-two young adults seeking treatment for psychological distress that cut across categorically-defined diagnoses and 72 matched healthy young adults. Participants performed a dynamic face task involving fearful, angry and happy faces (and geometric shapes) while undergoing functional Magnetic Resonance Imaging (fMRI). Pattern regression analyses consisted of Gaussian Process Regression (GPR) implemented in the Pattern Recognition for Neuroimaging toolbox (PRoNTo). Predicted and actual clinical scores were compared using Pearson's correlation coefficient (r) and normalized mean squared error (MSE) to evaluate the models' performance. Permutation test was applied to estimate significance levels. RESULTS: GPR identified patterns of neural activity to dynamic emotional face processing predictive of self-report anxiety in the whole sample, which covered a continuum that ranged from healthy to different levels of distress, including subthreshold to fully-syndromal psychiatric diagnoses. Results were significant using two different cross validation strategies (two-fold: r=0.28 (p-value=0.001), MSE=4.47 (p-value=0.001) and five fold r=0.28 (p-value=0.002), MSE=4.62 (p-value=0.003). The contributions of individual regions to the predictive model were very small, demonstrating that predictions were based on the overall pattern rather than on a small combination of regions. CONCLUSIONS: These findings represent early evidence that neuroimaging techniques may inform clinical assessment of young adults irrespective of diagnoses by allowing accurate and objective quantitative estimation of psychopathology
    corecore